Meta-Interpretive Learning of Data Transformation Programs

نویسندگان

  • Andrew Cropper
  • Alireza Tamaddoni-Nezhad
  • Stephen Muggleton
چکیده

Data transformation involves the manual construction of large numbers of special-purpose programs. Although typically small, such programs can be complex, involving problem decomposition, recursion, and recognition of context. Building such programs is common in commercial and academic data analytic projects and can be labour intensive and expensive, making it a suitable candidate for machine learning. In this paper, we use the meta-interpretive learning framework (MIL) to learn recursive data transformation programs from small numbers of examples. MIL is well suited to this task because it supports problem decomposition through predicate invention, learning recursive programs, learning from few examples, and learning from only positive examples. We apply Metagol, a MIL implementation, to both semi-structured and unstructured data. We conduct experiments on three real-world datasets: medical patient records, XML mondial records, and natural language taken from ecological papers. The experimental results suggest that high levels of predictive accuracy can be achieved in these tasks from small numbers of training examples, especially when learning with recursion.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Effective Components of Educational Transformation in Agricultural Higher Education System in Iran

The purpose of the present study was to analyze the effective components of educational transformation in agricultural higher education system in Iran by a mixed study method. The statistical population includes all faculty members (N=361) teaching in agricultural college of Tehran, Tarbiat Modares and Shiraz University, and a sample of 186 faculty members (n=186) were selected by stratified ra...

متن کامل

Logical Vision: One-Shot Meta-Interpretive Learning from Real Images

Statistical machine learning is widely used in image classification. However, most techniques 1) require many images to achieve high accuracy and 2) do not provide support for reasoning below the level of classification, and so are unable to support secondary reasoning, such as the existence and position of light sources and other objects outside the image. In recent work an Inductive Logic Pro...

متن کامل

Standardizing clinical laboratory data for the development of transferable computer-based diagnostic programs.

The existence of systematic differences between test results obtained at different laboratories can compromise the development of generally accessible reference databases for interpretive pathology. We review approaches to the elimination of inter-laboratory bias from pathology test results through the use of standard unit transformations. A general transform procedure is described that will pe...

متن کامل

OpenFortran: Extending Fortran with Meta-programming

Meta-programming has shown much promise for improving the quality of software by offering programming language techniques to address issues of modularity, reusability, maintainability, and extensibility. A system that supports metaprogramming is able to generate or manipulate other programs to extend their behavior. This paper describes OpenFortran, a Meta-Object Protocol (MOP) that is able to ...

متن کامل

Learning Higher-Order Logic Programs through Abstraction and Invention

Many tasks in AI require the design of complex programs and representations, whether for programming robots, designing game-playing programs, or conducting textual or visual transformations. This paper explores a novel inductive logic programming approach to learn such programs from examples. To reduce the complexity of the learned programs, and thus the search for such a program, we introduce ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015